478 research outputs found

    Evaluation of an Algorithm for the Automatic Detection of Salient Frequencies in Individual tracks of Multi-track Musical Recording

    Get PDF
    This paper evaluates the performance of a salient frequency detection algorithm. The algorithm calculates each FFT bin maximum as the maximum value of that bin across an audio region and identifies the FFT bin maximum peaks with the highest five deemed to be the most salient frequencies. To determine the algorithm’s efficacy test subjects were asked to identify the salient frequencies in eighteen audio tracks. These results were compared against the algorithm’s results. The algorithm was successful with electric guitars but struggled with other instruments and in detecting secondary salient frequencies. In a second experiment subjects equalised the same audio tracks using the detected peaks as fixed centre frequencies. Subjects were more satisfied than expected when using these frequencie

    An investigation into the efficacy of methods commonly employed by mix engineers to reduce frequency masking in the mixing of multitrack musical recordings

    Get PDF
    Studio engineers use a variety of techniques to reduce frequency masking between instruments when mixing multi-track musical recordings. This study evaluates the efficacy of three techniques, namely mirrored equalization, frequency spectrum sharing and stereo panning, against their variations to confirm the veracity of accepted practice. Mirrored equalisation involves boosting one instrument and cutting the other at the same frequency. Frequency spectrum sharing involves low pass filtering one instrument and high pass filtering the other. Panning involves placing two competing instruments at different pan positions. Test subjects used eight tools comprising a single unlabeled slider to reduce frequency masking in several two instrument scenarios. Satisfaction values were recorded. Results indicate subjects preferred using tools that panned both audio tracks

    A linear noise approximation for stochastic epidemic models fit to partially observed incidence counts

    Get PDF
    Stochastic epidemic models (SEMs) fit to incidence data are critical to elucidating outbreak dynamics, shaping response strategies, and preparing for future epidemics. SEMs typically represent counts of individuals in discrete infection states using Markov jump processes (MJPs), but are computationally challenging as imperfect surveillance, lack of subject-level information, and temporal coarseness of the data obscure the true epidemic. Analytic integration over the latent epidemic process is impossible, and integration via Markov chain Monte Carlo (MCMC) is cumbersome due to the dimensionality and discreteness of the latent state space. Simulation-based computational approaches can address the intractability of the MJP likelihood, but are numerically fragile and prohibitively expensive for complex models. A linear noise approximation (LNA) that approximates the MJP transition density with a Gaussian density has been explored for analyzing prevalence data in large-population settings, but requires modification for analyzing incidence counts without assuming that the data are normally distributed. We demonstrate how to reparameterize SEMs to appropriately analyze incidence data, and fold the LNA into a data augmentation MCMC framework that outperforms deterministic methods, statistically, and simulation-based methods, computationally. Our framework is computationally robust when the model dynamics are complex and applies to a broad class of SEMs. We evaluate our method in simulations that reflect Ebola, influenza, and SARS-CoV-2 dynamics, and apply our method to national surveillance counts from the 2013--2015 West Africa Ebola outbreak

    Efficient data augmentation for fitting stochastic epidemic models to prevalence data

    Full text link
    Stochastic epidemic models describe the dynamics of an epidemic as a disease spreads through a population. Typically, only a fraction of cases are observed at a set of discrete times. The absence of complete information about the time evolution of an epidemic gives rise to a complicated latent variable problem in which the state space size of the epidemic grows large as the population size increases. This makes analytically integrating over the missing data infeasible for populations of even moderate size. We present a data augmentation Markov chain Monte Carlo (MCMC) framework for Bayesian estimation of stochastic epidemic model parameters, in which measurements are augmented with subject-level disease histories. In our MCMC algorithm, we propose each new subject-level path, conditional on the data, using a time-inhomogeneous continuous-time Markov process with rates determined by the infection histories of other individuals. The method is general, and may be applied, with minimal modifications, to a broad class of stochastic epidemic models. We present our algorithm in the context of multiple stochastic epidemic models in which the data are binomially sampled prevalence counts, and apply our method to data from an outbreak of influenza in a British boarding school

    An Investigation into the Relationship Between the Subjective Descriptor Aggressive and the Universal Audio 1176 FET Compressor

    Get PDF
    In popular music productions, the lead vocal is often the main focus of the mix and engineers will work hard to impart creative colouration on this source. This paper conducts listening experiments to test if there is a correlation between perceived distortion and the descriptor “aggressive” which is often used to describe the sonic signature of the Universal Audio 1176. The results from this study show compression settings that impart audible distortion are perceived as aggressive by the listener and. Furthermore, there is a strong correlation between the subjective scores for distortion and aggressive. eIt was also shown that there is a strong correlation between compression settings measured rated to have 0.5% THD and above were rated as both the most distorted and most aggressive. high aggressive scores and the audio feature roughness

    Exploring the Container Metaphor for Equalisation Manipulation

    Get PDF
    This paper presents the first stage in the design and evaluation of a novel container metaphor interface for equalisation control. The prototype system harnesses the Pepper's Ghost illusion to project mid-air a holographic data visualisation of an audio track's long-term average and real-time frequency content as a deformable shape manipulated directly via hand gestures. The system uses HTML 5, JavaScript and the Web Audio API in conjunction with a Leap Motion controller and bespoke low budget projection system. During subjective evaluation users commented that the novel system was simpler and more intuitive to use than commercially established equalisation interface paradigms and most suited to creative, expressive and explorative equalisation tasks

    Audio interfaces should be designed based on data visualisation first principles

    Get PDF
    Audio mixing interfaces (AMIs) commonly conform to a small number of paradigms. These paradigms have significant shortcomings. Data visualisation first principles should be employed to consider alternatives. Existing AMI paradigms are discussed and concepts of image theory and elementary perceptual elements outlined. AMIs should be evaluated by usability experiments however performing these properly is time-consuming. There are many data visualisation options and combinations. Collaboration with others would enable a greater range to be explored. Better understanding data visualisation will benefit audio and music interface development in general

    Novel Designs for the Audio Mixing Interface Based on Data Visualisation First Principles

    Get PDF
    Given the shortcomings of current audio mixing interfaces (AMIs) this study focuses on the development of alternative AMIs based on data visualisation first principles. The elementary perceptual tasks defined by Cleveland informed the design process. Two design ideas were considered for pan: using the elementary perceptual tasks ‘scale’ to display pan on either a single or multiple horizontal lines. Four design ideas were considered for level: using ‘length’, ‘area’, ‘saturation’ or ‘scalable icon’ for visualisation. Each level idea was prototyped with each pan idea, totalling eight novel interfaces. Seven subjects undertook a usability evaluation, replicating a 16 channel reference mix with each interface. Results showed that ‘scalable icons’, especially on multiple horizontal lines appear to show potential

    Formal usability evaluation of audio track widget graphical representation for two-dimensional stage audio mixing interface

    Get PDF
    The two-dimensional stage paradigm (2DSP) has been suggested as an alternative audio mixing interface (AMI). This study seeks to refine the 2DSP by formally evaluating graphical track visualisation styles. Track visualisations considered were text only, circles containing text, individually coloured circles containing text, circles colour coded by instrument type with text, icons with text superimposed, circles with RMS related dynamic opacity and a traditional AMI. The usability evaluation focused on track selection efficiency and included user visualisation preference for this micro-task. Test subjects were instructed to click five randomly selected tracks for a six, sixteen and thirty-two track mix for each visualisation. The results indicate text only visualisation is best for efficiency however test subjects preferred icons and traditional AMI
    corecore